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Quickest Change Detection 
in Adaptive Censoring Sensor Networks 

Xiaoqiang Ren*, Karl H. Johansson^, Dawei Shi^ and Ling Shi* 


Abstract —The problem of quickest change detection with 
communication rate constraints is studied. A network of wireless 
sensors with limited computation capability monitors the envi¬ 
ronment and sends observations to a fusion center via wireless 
channels. At an unknown time instant, the distributions of 
observations at all the sensor nodes change simultaneously. Due 
to limited energy, the sensors cannot transmit at all the time 
instants. The objective is to detect the change at the fusion center 
as quickly as possible, subject to constraints on false detection and 
average communication rate between the sensors and the fusion 
center. A minimax formulation is proposed. The cumulative sum 
(CuSum) algorithm is used at the fusion center and censoring 
strategies are used at the sensor nodes. The censoring strategies, 
which are adaptive to the CuSum statistic, are fed back by 
the fusion center. The sensors only send observations that fall 
into prescribed sets to the fusion center. This CuSum adaptive 
censoring (CuSum-AC) algorithm is proved to be an equalizer 
rule and to be globally asymptotically optimal for any positive 
communication rate constraint, as the average run length to false 
alarm goes to infinity. It is also shown, by numerical examples, 
that the CuSum-AC algorithm provides a suitable trade-off 
between the detection performance and the communication rate. 

Keywords: censoring, quickest change detection, minimax, 
CuSum, asymptotically optimal, adaptive, wireless sensor net¬ 
works. 


I. Introduction 

Background and Motivations: The goal of quickest change 
detection is to detect the abrupt change in stochastic processes 
as quickly as possible subject to certain constraints on false 
detection. This problem has a wide range of applications, 
such as habitat monitoring Q, quality control engineering Q, 
computer security ||^ and cognitive radio networks Q. In the 
classical quickest change detection formulation, the decision 
maker observes a sequence of observations {Xi,..., Xk, ■ ■.}, 
the distribution of which changes at an unknown time instant 
ly. The observations before the change Xi,...,X^_i are 
independent and identically distributed (i.i.d.), and the obser¬ 
vations after the change X^,... ,Xoo are also i.i.d. but with 
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a different distribution. The change event model distinguishes 
two problem formulations: the Bayesian formulation due to 
Shiryaev |j^, Kjand the minimax formulation due to Lorden 
0 and Poliak'"^. 

The classical quickest change detection problem does not 
consider the cost of acquiring observations. It assumes that the 
decision maker can access observations at all the time instants 
freely. This is an issue for resource-limited applications, such 
as those using wireless sensor networks (WSNs). In the prob¬ 
lem of quickest change detection with WSNs, observations are 
taken by one or multiple sensors, which communicate with the 
decision maker via wireless channels 0-ini- The limited 
resources, which include limited energy for each battery- 
powered sensor and the limited communication bandwidth, 
naturally pose the constraint that the observations cannot be 
sent to the decision maker continuously. Thus, we consider the 
problem of quickest change detection with such constraints. 

Related Literatures and Contributions: Recently, there are 
several works on constrained quickest change detection with 
minimax formulations im-iiD- Two classes of characteriza¬ 
tions of the cost acquiring observations were considered: the 
cost of sampling fnj, |T 3 , |T4) and the cost of communi¬ 
cation ID, |T2), fT^ . In these works, algorithms consisting 
of stopping times and sampling/transmission schedulers were 
proposed. By transmission (sampling) schedulers, when local 
sensors send their data to a fusion center (a decision maker 
samples) is determined. The communication constraint was 
studied from the perspective of quantization as well as the 
communication rate in p3| but with general sequential de¬ 
tection settings, where an interesting point was that only one 
bit of information was sent to the fusion center whenever a 
transmission occured. 

In this paper, we only take the cost of communication into 
account. This is motivated by WSNs applications for which 
the energy cost of sampling is usually negligible compared 
with that of communication ®,G7). Thus, it is a reasonable 
assumption that the sensors can take observations at each time 
instant but with limited number of communications with the 
fusion center. Furthermore, as in fTO) , fl^ , IT?) , we do not 
consider quantization errors of the data sent from local sensors 
to the fusion center. 

The structure of the system considered in this paper is 
illustrated in Fig. [T] Observations are taken by M sensors 
and are sent to a fusion center via wireless channels. Due to 
limited energy, the remote sensors cannot transmit at all the 
time instants. To make the best use of the limited resources, 
the sensors are assumed to adopt censoring strategies ID- 
Each of the sensors samples at each time instant, but only 
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transmits informative samples. The censoring strategies are 
adaptive to the detection statistic available at the fusion center. 
When necessary, the fusion center tells the sensors about the 
censoring strategies to use via the feedback channel^ 

To deal with communication constraints, in the existing lit¬ 
erature fTO), fig, |14), 1^, CuSum-like algorithms (CuSum 
in ||12|, a variant called DE-CuSum in m), GD) are 
run locally at the sensor nodes and the detection statistic 
of the algorithm is sent to the fusion center only when it 
is above a certain threshold. In this paper, a fundamentally 
different approach is adopted: the observations instead of 
the detection statistic are censored and transmitted to the 
fusion center. Compared to running CuSum-like algorithms, 
the online computation load required at the sensor side to 
censor the observations is reduced (see Remark [TJ. Another 
advantage is that our algorithm is an equalizer rule, which 
helps reduce off-line computation complexity (see Remark!^. 
Compared with decentralized settings in pO) , p^ , p^ , 
d), where the communication is unidirectional and remote 
sensors implement the censoring strategy in an autonomous 
manner, our algorithm is centralized in the sense that the 
censoring strategies used at remote sensors are fed back by the 
fusion center. Evidently, the feedback transmission introduced 
complicates the system, although it occurs only occasionally 
and its message is rather simple. Our algorithm, however, is 
able to reduce system complexity by reducing the number 
of sensors required. More specifically, to achieve the same 
detection performance, fewer sensors are required compared 
with the decentralized counterparts. A similar idea can be 
found in | [20) . About the decentralized censoring strategies, we 
point out that to utilize the information of past observations 
available at each sensor, it is necessary to censor the detection 
statistic instead of current observations. Note that in this paper, 
we do not consider the cost of sampling, while if the sampling 
is “energy consuming” (i.e., the energy consumed by sampling 
is comparable to that by communication), then the DE-CuSum 
algorithm running locally at the sensor nodes which skips 
sampling when necessary, is desirable. 

In summary, the main contribution is that for the quickest 
change detection with communication rate constraints in the 
minimax formulation, a novel algorithm is proposed, which 
is the CuSum algorithm coupled with adaptive censoring 
strategies (CuSum-AC). The CuSum-AC algorithm is proven 
to be asymptotically optimal for any positive communication 
rate constraint and thus provides some insights into how 
one can utilize censoring strategies adaptively to achieve the 
globally asymptotic optimality. 

Organizations: The remainder of this paper is organized 
as follows. The one sensor case is studied in Sections m 


IV The mathematical formulation of the considered problem 
given in Section We present the CuSum-AC algorithm 


in Section III The main results are given in Section IV 


’The feedback transmissions may cause additional energy consumption at 
the sensor side. But one should note that in our algorithm, the feedback 
message is quite simple (see Remark [TJ and the feedback transmissions are 
usually quite few (see Remark j^. In particular, when CuSum-AC algorithm 
with N = 2 levels is used, only one bit of information is needed when 
feedback occurs. 



Eigure 1: The quickest change detection system in adaptive 
censoring sensor networks. Each sensor corresponds to the 
blue rectangle in Eig. 



Eigure 2: The quickest change detection system with a sensor 
that adopts an adaptive censoring strategy. 


Eirst we show that the CuSum-AC algorithm is an equalizer 
rule, i.e., the worst-worst case detection delay is attained 
whenever the change event happens. Then we prove that the 
CuSum-AC algorithm is globally asymptotically optimal for 
any positive communication rate constraint. We generalize 
the results obtained for the one sensor case to the multiple 
sensors scenario in Section |V] Numerical examples are given 
in Section VI to illustrate the main results. Some concluding 
remarks are given in the end. All the proofs are presented in 
Appendices. 


Notations'. N, N_| 


and 


^++ 


are the set of non¬ 


negative integers, positive integers, real numbers, non-negative 
real numbers and positive real numbers, respectively, k G N 
is the time index. 1^4 represents the indicator function that 
takes value 1 on the set A and 0 otherwise, x stands for the 
Cartesian product. Eor x G M, (x)+ = max(0,x). 


II. Problem Setup 

Eor simplicity of presentation, first we consider the one 
sensor case; see Fig. [^ Then we extend the results to the 
sensor networks scenario in Section jV] Consider the change 
detection system in Fig. [^ A sequence of observations, say 
{-^fc}fcGN+, about the monitored environment are taken locally 
at the sensor. Assume that i/ is an unknown (but not random) 
time instant when a change event takes place. The instant may 
be oo, corresponding to that the change never happens. The 
observations at the sensor before ly, {Xi ,..., Xy_i}, are i.i.d. 
with probability density function (pdf) /o, and the observations 
from ly on, X,j, 2f^-i-ij •.., are i.i.d. with pdf /i. Let denote 
the probability measure when the change happens at iz. If there 
is no change, we denote this measure by Poo. The expectation 
and Eoo are defined accordingly. 

To characterize the behavior that the sensor cannot send 
the observation Xk to the decision maker all the time, we 
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introduce a binary variable 7 fc as 


J 1, if Xk is sent to the decision maker, 
[ 0, otherwise. 


Then the information pattern available for the deci¬ 
sion maker at the time instant k is given by Ik = 
{(l, 7 i, 7 iXi),..., {fc, 7 fc, 7 feXfc)} with Iq = {0}- A random 
variable T G N+ is called a stopping time if {T = k} G 
(T(Ifc), where a{Ik) is the smallest sigma-algebra of Ik- A 
stopping time can be characterised by a stopping rule, which 
is a mechanism that decides whether or not to stop based on 
the available information. 

To make the best use of the limited communication re¬ 
sources, the censoring strategy is implemented at the sensor 
node. We consider an adaptive censoring strategy, which 
varies with the information pattern. Specihcally, the censoring 
strategy used at the sensor node at time instant k, which is 
denoted by ipk, is determined by the decision maker based 
on Ik- 1 - When ipk tpk-i, the decision maker sends ipk to 
the sensor through the feedback channel. Since the sensor is 
assumed to have no memory and can thus only access Xk at 
time k, the censoring strategy r/;fc : K i—{0,1} has the form 
as 7 fc = i/jkiXk)- The censoring policy along the horizon is 
given by T' = {-tpi ,... ,'0t)- 

The communication constraint is formulated as the limited 
communication rate before the change event happens. It de¬ 
pends on the censoring policy tk and is formalized as 


r('I') = limsup —Eq 

n—foo ^ 


^7fc|r > n 


.fc=i 


< e, 


( 2 ) 


where 0 < e < 1 is a design parameter. By adjusting e, 
a tradeoff between communication resources and detection 
performance is obtained. Note that the post-change period (i.e., 
the detection delay) is usually quite small compared with the 
pre-change period, hence we only pose the communication 
constraint before the change. The conditional expectation 
Eoo['|7’ > n] thus is considered. The asymptotic optimality 
result of the paper does not hold if the total cost is considered 
(see Remark |^. A similar criterion called pre-change trans¬ 
mission cost is considered in 1191 . 

For the detection performance of the quickest change detec¬ 
tion, there are two indices: the risk of false detection and the 
detection delay. Given T and T*, the risk of false detection 
is characterized by the average run length to false alarm 
(ARLFA) 


case detection delay ||7]|^ which is given by 
dijT, T')= sup { esssupEi,[(T - 1 / -I- l)+|I,,_i]}. (3) 

l<I/<00 Xjy—l 

Problem 1. 

minimize (T, T*), 

T,'!' 

subject to p(T, (4) 

^(4-) < e, (5) 

where Q>1 is a given lower bound of the ARLFA. 

Note that for the classical formulation of the quickest 
change detection, the observations are assumed to be i.i.d. 
conditioned on the change event. While since 4/ is adaptive, 
the available observation sequence {'^k,lkXk} are correlated 
across the time. To solve the above problems thus is quite 
challenging. 

To avoid degenerate problems, we make the following 
assumption for the remainder of this paper. 

Assumption 1. 

0<I(/i||/o) <00, 0<I(/o||/i) <00, 

where I(/i||/o) = 4/i(x) In I(/o||/i) = 

/r/ o(3:) In are the Kullback-Leibler (K-L) diver¬ 

gences. 

Our subsequent analysis utilizes the CuSum algorithm, 
which is stated as follows. Let constant a be a given threshold 
and i{Xk) = In the log-likelihood ratio function. The 

stopping time for the CuSum algorithm thus is computed as 

T{a) = infjfc : Ck > a}, (6) 

where Ck is the detection statistic for the CuSum algorithm 
computed by 

Cfe = (cfe-i + 

with Co = 0. The CuSum algorithm is optimal for original 
Lorden’s formulation when a is chosen such that Eqo [d’(a)] = 
( | [22) , | |23| . When there is no communication rate constraint, 
i.e., e = 1, Problem[2is reduced to original Lorden’s formula¬ 
tion. We should remark that it is difficult to hnd strictly optimal 
algorithms for Problem [T] We hence focus on asymptotically 
optimal solution. For simplicity, we use T(a) to denote the 
CuSum algorithm with the threshold a for the remainder of 
this paper. 


III. CuSum-AC Algorithm 


5(T, vk) = Eoo [T ]; 

cf., Q, | [2Tj . Note that the reciprocal of the ARLFA is 
connected to the false alarm rate. When the ARLFA goes to 
infinity, the false alarm rate goes to zero. The stopping time 
T is related to Ik, which is determined by the observation 
sequence {Xk} and the censoring policy T*, so the ARLFA is 
related with rk. To highlight this dependence, we use g{T, T*) 
in the above dehnition. 

For the detection delay, we consider Lorden’s worst-worst 


In this section, we present the proposed CuSum-AC algo¬ 
rithm, which is the CuSum algorithm coupled with a censoring 
policy that adaptively switches between different censoring 
strategies. We say a CuSum-AC algorithm is with N levels 
if the number of censoring strategies used is N. For ease 
of presentation, we present the CuSum-AC algorithm with 
N = 2 levels, and for the CuSum algorithm with N > 2 
levels, see ( [T2] i and Remark]^ In the remainder of this paper, 

^It is easy to see that the main result in this paper, i.e., asymptotical opti¬ 
mality of our algorithm, also holds when Poliak’s criterion is considered. 
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if not particularly indicated, the CuSum-AC algorithm refers 
to the one with N = 2 levels. The algorithm consists of three 
parts: how the detection statistic updates, what the adaptive 
censoring policy is, and when the algorithm stops and declares 
the change. 

Let a and oi be given thresholds with oi < a. The detection 
statistic Sfc are updated as follows: 


Sfe =max{ 0 ,Sfc_i+£’^'‘( 7 fc,A:fc)}, ( 7 ) 

_/ ai, if Sfe_i < Oi andsfc > oi, 

^ \ Otherwise, ^ 


with initial value sq = 0 - The quantity ( 7 ^, is the log- 
likelihood ratio function of the random variable 7 feXs 0 under 
the censoring strategy t/jk- 


]Pi{7fc = OlV’fe} 


+ In- 


’ ]Poo{7fe = OlV'fe} 
The adaptive censoring policy is given by 

if Sk-i > ai, 


(1 - Ik) 


V'fc = 


ijj*iei), if Sk-i < ai, 


(9) 


where 0 < ei < 1 and is dehned as follows. Let 

0 < e < 1 , dehne 


V’*(e) = argmaxI’^(/i||/o), (10) 

ipec(e) 

where C(e) = {ip :Poo {ip (Xk) = 1} = e}, and I’^(/i||/o) 
is the K-L divergence of the observations available at the 
decision maker under the censoring strategy ip: 

I’^(/i||/o)=Ei[^’^(7i,Xi)]. 


Among the censoring strategies that have communication 
rate e, the strategy ip*{e) has the maximal post-censoring 
K-L divergence. In general, ip*{e) does not have analytic 
expressions, but it is well known that ip*{€) has a special 
structure: the likelihood ratio of the no-send region is a single 
interval GD- The upper and lower bounds of this single 
interval is obtained via numerical simulations. 

The stopping time for the CuSum-AC algorithm is given by 


Tc{a) = infjfc : Sk > a}. (11) 

In the following, we use Tc{x, y) to denote the stopping time 
as Tc{a) for which the initial statistic is sq = the 

threshold is y. 

In summary, the CuSum-AC algorithm is illustrated in 
Fig. a A few remarks on the algorithm are presented as 
follows. 

Compared with the CuSum algorithm, two additional pa¬ 
rameters Cl, oi are introduced for the CuSum-AC algorithm. 
As in the CuSum algorithm, the parameters involved in 
the CuSum-AC algorithm, i.e., ei,ai together with a, can 
only be determined by numerical simulations. In practice, if 
the required ARLFA is large enough (i.e., the threshold a 


^Note that since a censoring strategy is adopted, when yj, = 0, the decision 
maker still has a rough information about X^- 


is chosen large enough), the communication rate constraint 
and the ARLFA constraint can be met independently. More 
specihcally, to meet the communication rate constraint, one 
may hrst hx a large enough a (any value) and then obtain an 
appropriate pair (ai,ei). Since given a pair (ei,ai), when a 
is large enough, the communication rate is almost a constant, 
one then may meet the ARLFA constraint by just adjusting a. 

The detection statistic of the CuSum-AC algorithm is re¬ 
set to switching threshold oi whenever it crosses oi from 
below. This facilitates the asymptotic optimality analysis of 
the CuSum-AC algorithm and makes the stopping time Tc{a) 
of the CuSum-AC algorithm an equalizer rule (the details of 
which are given in Section [TV]l. 

We now elaborate on the adaptive censoring strategy. Note 
that by dehnition, for a hxed 0 < e < 1 , ip*{e) is the 
most informative in the sense that it achieves the largest post- 
censoring K-L divergence with communication rate e. While 
by adjusting e, one can trade off communication cost against 
information quality of ip*{e). On the one hand, the larger e is, 
the more information of the observations taken at the sensor 
node is conveyed to the decision maker under ip*{e). On the 
other hand, if ei > 62 , ip*{ei) incurs more communication 
cost than ip*( 62 ) does. Intuitively, one tends to use a censoring 
strategy ip*{e) with a larger e when it is deemed “more 
important”. At each time k, our adaptive censoring strategy 
in (|^ tends to use a censoring strategy ip*{e) with a larger 
e when Sk-i is larger. This idea comes from the observation 
of the typical evolution of the CuSum algorithm as illustrated 
in Fig. 1^ The detection statistic goes up and down before 
it reaches the threshold. At most times, the detection statistic 
stays small. Note that if the sojourn time when Ck stays in one 
interval is large enough, the change of Ck in that interval can be 
approximated using the statistical property of the observations 
without knowing each observation. Let us take two extreme 
cases for example. Let Ti and T 2 be the sojourn time when 
Ck is in interval 1 and interval 2, respectively. Suppose that 
Ti is sufficiently large, while T 2 = 1. Then by the renewal 
theorem p4) , one knows that the change of Ck in interval 1 
can be obtained by Ai(cfe) « TiI(/i||/o). This means that 
almost no information is lost for the decision maker even if 
no messages are sent by the sensor. However, for the case 
T 2 = 1 , in order to make the change A 2 (cfc) known to the 
decision maker, this single observation has to be sent to the 
decision maker, the communication rate of which is 1. Based 
on this notion, one extends the adaptive censoring strategy to 
N > 2 levels as follows: 


ip*il), if Sk-i > ai, 
ip*{ei), if 02 < Sk-i < ai, 


ipk = < 


1p*{f-N-2)i if OJV-I < Sk-l < aN-2, 
, if Sk-i < oat-i, 


( 12 ) 


with 0 < cat-i < eN -2 < • • • < ei A 1 and oat-i < aN -2 < 

■ ■ ■ < ai- 


Remark 1. Now we discuss the practical implementation 
of the CuSum-AC algorithm with general N levels and the 
online computation load at the sensor side. The parameters. 






5 


^ Initialization 

6 

II 

0 

0 

1! 

•a 





Determine censoring strategy 
as in Equation [^. 




fc = fc + 1, 

update Sf, as in Equation |7J and 



No 

Yes 


Stop and declare the change. 


Figure 3; CuSum-AC algorithm 


i.e., 'ijj*{eN-i), ■ ■ ■ ■ ■ ■ ,cti and a, are deter¬ 

mined prior to the system run time. The censoring strategies 
'0*(e„), 1 < n < TV — 1 are stored in the sensor nod^and 
the feedback message from the fusion center is the strategy 
index n (together with n = 0 representing The 

feedback happens when ipk f "fk-i- Note that '0*(e„) has a 
special structure: the likelihood ratio of the no-send region is 
a single interval. Hence, to store a censoring strategy ip*(en), 
it suffices to store the corresponding lower and upper bounds 
of the likelihood ratio (or the observations in some special 
cases, see below). The only computation task of the remote 
sensor is to implement the censoring strategies the 

computational load of which is explained for the following 
two cases. For general distributions fi and fo, the sensor first 
computes the likelihood ratio of X}. and then compares it to 
the upper and lower bounds. Note that to run the CuSum-like 
algorithms locally, the remote sensor needs to further compute 
the logarithm of the likelihood ratio. Since comparisons have 
negligible computational load compared with that of comput¬ 
ing a logarithm, the computational load at the remote senor 
for our algorithm is much lower. If the distributions fi and fo 
are such that the likelihood ratio function is monotone, then 
a single interval of the likelihood ratio also implies a single 
interval of the observations. Then to implement f*(en), the 
sensor just needs to compare the observations directly to the 
corresponding lower and upper bounds of the observations. 
The family of distributions that have monotone likelihood ratio 
property is quite large, e.g., exponential. Binomial, Poisson 
and normal distributions with known variances. Although, in 
most of these cases, the computational load of log-likelihood 
ratio is also low— one does not need to actually compute the 
logarithm but only elementary computations are required. Still, 
compared with running CuSum-like algorithms, the computa¬ 
tional load at the remote sensor is considerably reduced in our 
algorithm, since comparisons are much simpler to compute 
than multiplications, especially when the observations take on 
real values. 


^There is no need to store t/i* (1), under which no observations are censored 
at all. 



is a mean shift in Gaussian noise, where the parameters used 
are as follows: before the change Poo : Xk A/'(0,1), after 
the change Pi : Xj. ~ A/'(0.5,1), change time ly = 60 and 
threshold a = 4.5. 

IV. Performance Analysis 

In this section, we first show that the Tc{a) is an equalizer 
rule, i.e., the detection delay dL{Tc{a),'$) is attained for any 
change time iz. We then prove Tc(a) is asymptotically optimal 
for any communication rate constraint. 

A. Supporting Definitions 

The classical performance analysis of the CuSum algorithm 
interprets the CuSum algorithm as a sequence of two-sided 
sequential probability ratio tests (SPRTs) | [25] . This technique 
is also used for our analysis of the CuSum-AC algorithm. 
Intuitively, the CuSum-AC algorithm is a sequence of two- 
sided (0 and a) SPRTs with switching modes (original or 
censored) of observations. Each time the detection statistic 
crosses oi from below, it is reset to be oi. This behavior is 
mathematically characterized as follows. 

Define a stopping time of an SPRT with a starting point 
0 < 2 :<a — tti as a variable: 

'q(z) =mf |n : z -f '^i{Xk) f. [0, a - Oi]| . 

Note that 77 ( 0 ) can be viewed as the hrst time that the detection 
statistic jumps out from [ai,a] with the initial point oi. It 
either crosses the threshold a or returns to [0, oi] and starts 
a test with censored observations. We denote by s,,( 2 ) the 
detection statistic at the time instant rj{z) bounded below by 
zero: 

/ \ ^ 

^r}{z) = I . 2 : -f ^ ^ f(X}f) -f Ui I 

Dehne a detection statistic Sk(z), which is updated in the 
same manner with that in the CuSum algorithm but with an 
initial point 0 < z < Oi and censored observations. The details 
are as follows: 

hiz) = (^h-iiz) + £^'^^^\Tk,Xk)y , (13) 

§ 0 ( 2 :) = 2 :. 

Based on Sfc(z), we define a stopping time by 
(j)(z) = inf {k : Sk(z) > Oi} . 
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As the CuSum-AC algorithm starts at 0, ^(0) can be inter¬ 
preted as the first time that it reaches ai and switches the 
observation mode from the censored one to the original one. 
Let 

«■ = ^(0) +0(S^(O)) l{S,(o)<ai}- (14) 

The CuSum-AC algorithm can be interpreted as a sequence 
of SPRTs with stopping times being of two distributions. 
Specifically, the CuSum-AC algorithm starts with the stopping 
time distributed as ^(0), and after the time instant (j){0), it is a 
sequence of SPRTs with stopping times i.i.d. distributed as $. 

B. Equalizer Rule and Asymptotic Optimality 

Theorem 1. The stopping time Tc{a) is an equalizer rule for 
Problem\I\ i.e., 

dL{Tc{a), T') = ess sup El,[(Tc(a) -v + l)+|Z„_i], Mv > 1. 

i.-i 

Remark 2. In general, the parameters used for the algorithm 
(i.e., ei,ai and a) can only be obtained by numerically 
simulating the detection performance (i.e., the delay and the 
ARLFA). The above theorem means that the change time does 
not affect the value of T'). For simplicity, we can just 

let v = 1 to simulate the delay. 

We now focus on asymptotic optimality of Tc{a). Before 
presenting the main theorem, we first present the supporting 
lemma about the communication rate of the CuSum-AC algo¬ 
rithm as follows. 

Lemma 1. Given any finite oi > 0 and 0 < e < 1, there 
exists a non-empty set £(ai,e) such that when ^*(ei) with 
Cl € £{ai,e) is used, the communication rate constraint is 
uniformly satisfied for any a > ai (including -|-oo). In other 
words, given any finite Oi > 0 and 0 < e < 1, there exists a 
censoring strategy T' as in with Cl G £{ai,e), such that 

r(T') < e, Va e (oi,-l-oo]. 

The asymptotic optimality analysis involves the scenario 
where the threshold a —oo. The above theorem enables us to 
study the asymptotic performance of the CuSum-AC algorithm 
without worrying whether the communication constraint will 
be violated for some a. 

Given oi and e, we define a set £*{ai,e) as 

£*(ai,e) =£(ai,e)n£'(oi,e), (15) 

where the set £\ai, e) is given by 


it must belong to both £(ai,e) and £'{ai,e), then f*(ai,e) 
is a non-empty set for any oi and e. 

We are now ready to present the second main theorem. 

Theorem 2. For any e > 0, when a = InC, ei G £’*(ai,e) 
with any 0 < Oi < a are used, the CuSum-AC algo¬ 
rithm satisfies the ARFLA constraint 0 and communication 
rate constraint 0. Furthermore, the CuSum-AC algorithm 
is asymptotically (( -G oo) optimal for Problem^ i.e., as 
C —> oo. 

Remark 3. The globally (for any communication rate con¬ 
straint e > 0) asymptotic optimality of the Cusum-AC al¬ 
gorithm stated in Theorem relies on the following two 
factors: (i) the asymmetric behavior of the detection statistic 
for the CuSum-AC algorithm Sk on pre-change and post¬ 
change hypotheses, (ii) the fact that the communication rate 
defined in 0 is merely on the pre-change hypothesis. On the 
one hand, under Pqo, the expected duration of Sk being above 
Oi is finite even when a is infinite (i.e., Tff defined in ( |28| l is 
finite), then one can choose ei (equivalent to ip*{ei)) to make 
the communication rate (ARLFA) of the CuSum-AC algorithm 
arbitrarily small (large). On the other hand, under Pi, the 
expected duration of Sk being below oi is finite for any ci > 0, 
while the expected duration of Sk being above Oi goes to 
infinity when a — > (X). 

Remark 4. Note that the general N > 2 levels cases provide 
more degrees of design freedom (including a^-i ,... ,ai and 
ejv-i, • • •, ei) than the two levels case (including oi and ei). 
Following exactly the same arguments for Theorem one 
concludes that the CuSum-AC algorithm with N > 2 levels 
is asymptotically fC —> oo) optimal. Though the asymptotic 
optimality can be achieved for any censoring levels, better 
detection performance should be expected when N increases 
if C takes moderate values. 

Remark 5. We remark that the feedback transmissions 
needed for the CuSum algorithm are in general quite few. 
In particular, the average number of feedback transmission 
under P^o is Eqo [Nf] = From ( |32| l, the ratio 

of the average feedback transmission times to the ARLFA is 

EoelAff] _2_ 

Eoo[T'c(a)] — Eoo[77(O)]-|-Eoo[0(s,,(o))|sr,(o)<ai]lF’oo{s,,(o)<ai}’ 

which is usually negligible since the term 
lEoo [</> (s^( 0 )) 157,(0) < «i] IPoo{s,,( 0 ) < Oi} is in general 

quite large. 


£'(ai,e) V. Extension TO Sensor Networks 

={ei > 0 : Eoo [f ( 57 ,( 0 )) |s,,(o) < at] > Eoo [7"(ai)], Va > oi}. In this section, first we modify the considered problem to 

sensor networks. Then we generalize the CuSum-AC algorithm 

Recall that T{ai) is the stopping time for the CuSum algo¬ 
rithm with ai as the threshold. Under Assumption [T] using 
the standard performance analysis technique for the CuSum 
algorithm (e.g., P.142 of p5]), one sees that Eoo [T’(ai)] is 
finite for any finite oi. Using the same analysis for £(ai,e) 

(see the proof of Lemma [^, one sees that £'{ai,e) is not 
empty for any oi and e. Furthermore, when ei is small enough. 


presented in Section III and the results obtained in Section IV 
to this case. 


A. Problem Formulation 

The system is illustrated in Fig. Let Ai = {1,...,M}. 
As in the one sensor case, it assumed that at sensor m, 
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are i.i.d. with pdf /{o,m} and 
• • ■} are i.i.d. with pdf As in 0, {2^ , 

it is assumed that the change event affects all the sensors 
simultaneously at i/ and the observations are independent 
across the sensors, conditioned on the change point. 

Like in Q, let 7{m,fc} be indicator whether or not the 
sensor m sends its observation X^m,k} to the fusion center. 
Let be the censoring strategy used at the sensor m at 

the time instant k, i.e., 


Let = {V'fi.fc}) ■ • ■) V'fM.fc}} be the censoring strategies 
used at all the sensor nodes at time k and be the censoring 
policy along the horizon, i.e., = {tp ^,..., ip^}. 

The average communication rate before the change event 
happens for the network is defined by 


r(vp^) = limsup —ttE, 
^ ' n^oo nM 


M 




,k—l m—1 


■ (16) 


In 1141, the authors posed communication rate constraint for 


each channel in the multi-channel setting (the affected subset 
of the sensors is unknown). Since the change event affects 
all the sensors simultaneously in our case, we instead use the 
average communication rate of the whole network ( fTh] ). Given 
T and 4/^, the ARLFA is defined in the same way as in the 
one sensor case, i.e.. 


p(r,4-^)=Eoo [T]. 


where the detection statistic is updated as follows: 

M 

=max{0,s^i + ^ ^{m.fc})}, 

m—1 

M _ f Oi) if < “1 and > Oi, 

^ 1 otherwise, 


and the censoring strategies are given by 

r if s^i>ai, 

1 ^*(e{m,i}), otherwise, 


( 20 ) 


with 0 < e{m,i} ^ 1- Note that the censoring strategies used 
at all the sensor nodes are switched simultaneously, which are 
adaptive to the detection statistic available at the fusion center. 
This helps reduce the times of feedback and the feedback 
message can be broadcasted to all the sensor nodes by the 
fusion center. 

Let and ef = 

{e{i 4 },..., e{M,i}}- Note that X^ and ip-jp^ can be regarded 
as the “vector version” of Xk and ipk in one sensor case, 
respectively. The CuSum-AC algorithm for the multiple sen¬ 
sors case is equivalent to its counterpart in one sensor case 
working with X^ and ’ip'^. Thus the following theorems are 
straightforward. 


Theorem 3. The stopping time T^{a) is an equalizer rule 
for Problem^ i.e., for any iz > 1, 

= esssupE,[(T,^(a) - + l)+|I;^i]. 


Let 


{ (f: 7{m,l} j 7{m,l}2f{m,l} )i • • ■ : 

(/c, 7{m,fc} 5 ^{m,k}X^jYi,k} ) } 

and = {Z^,... ,Z^}. Then the Lorden’s detection delay 
is defined by 

dL(T, 4'^)= sup { esssupEi,[(r - -I-l)+|Z;^;i]}. 

1<1/<00 

Then the problem we are interested in is as follow: 

Problem 2. 


Define S'^{e,ai) as the counterpart of Z*(ai,e). To see 
that<e'(e) is not empty, one can pose an additional constraint 
that £{ 1 , 1 } = ••• = e{M,i}- Then the arguments follow 
straightforwardly from that of Z*(ai,e). 


Theorem 4. For any e > 0, when a = ln(^, € ^^(e, oi) 

with any 0 < oi < a are used, the CuSum-AC algorithm 
satisfies the ARFLA constraint © and communication rate 
constraint •HI- Furthermore, the CuSum-AC algorithm is 
asymptotically optimal for Problem^ i.e., as ^ oo. 


(In 0,4/^) 


In C 

I(/{ l,m} I l/{ 0 ,m} ) 


( 1 + 0 ( 1 )). 


minimize d^lT, ), 

subject to p(T, 4'^)>^, (17) 

r(4'^) < e, (18) 

where (p> 1 is a given lower bound of the ARLFA. 

B. CuSum-AC Algorithm for Multiple Sensors Case 

We only present and focus on the CuSum-AC algorithm 
with N = 2 levels for sensor networks; N > 2 levels can be 
generalized as in the one sensor case. Let a and oi be two 
thresholds. The stopping time is computed as 

T,^(a)=inf{A::sf >a}, (19) 


VI. Numerical Examples 

For simulations, we consider the problem of mean shift 
detection in Gaussian noise. It is assumed that M = 3 identical 
sensors are deployed and the pre-change and post-change 
distributions are /{o.m} -^(0,1) and A/’(0.5,1), 

respectively. For simplicity, in each example, the sensors use 
an identical censoring strategy. 

Example 1. The asymptotic optimality of the CuSum-AC 
algorithm is examined. We compare the detection performance 
of the CuSum-AC algorithm with that of the CuSum algorithm 
(which is the optimal one when there is no communication 
rate constraint). With different ARLFA’s (sufficiently large), 
the detection delays (i.e., Ei [T]) of these algorithms are 
simulated. For the CuSum-AC algorithm, two communication 
rate constraints are considered, i.e., e = 0.7 or e = 0.4. Note 







Figure 5; Detection delay versus the ARLFA for the CuSum 
algorithm and the CuSum-AC algorithm. 

that given a communication rate and ARLFA constraint, there 
may exist multiple admissible combinations of the parameters 
(i.e., a, ai,ei). To alleviate the computation burden, we set 
ei = 0.63 and ai = 0.78 for the case e = 0.7 and 
ei = 0.27 and oi = 0.79 for the case e = 0.4. The value 
of the threshold a is determined to make the communication 
rate constraint to be satished equally. Since given oi and 
ei, the communication rate is not strictly monotonic with a, 
multiple a’s (which have different ARLFA’s) can be found. 
In fact, given ai and ei, the communication rate remains the 
same when a varies if a is sufficiently large. The simulation 
results are given in Fig. It can be seen that as the ARLFA 
increases, the difference between the delay of the CuSum-AC 
algorithm (with communication rate either e = 0.7 or e = 0.4) 
remains approximately constant. This verihes the asymptotic 
optimality, since the difference will be negligible when the 
ARLFA goes to inhnity. Furthermore, we can see that for the 
CuSum-AC algorithm with the same ARLFA, the one which 
has the smaller communication rate (i.e., e = 0.4) has the 
larger detection delay. This is consistent with our intuition 
that better detection performance can be expected when more 
communication resources are used. We also note that when 
the communication rate is 0.4, the delay of the CuSum-AC 
algorithm is only around 1.2 time slots larger than that of 
the CuSum algorithm (the communication rate of which is 1). 
This shows good detection delay versus communication rate 
trade-off for the CuSum-AC algorithm, which will be shown 
further in the next example. 

Example 2. We compare the CuSum-AC algorithm with 
the CuSum algorithm with the random transmission policy. 
In the random transmission policy, whether an observation is 
transmitted or not is determined randomly such that the com¬ 
munication rate is satished. The random transmission policy 
is quite simple and serves as a lower bound of the detection 
performance in some sense. We plot Fig. in the following 
way. The ARLFA is hxed to be 10 000, i.e., the parameters for 
the algorithm (a for the random transmission control policy, 
and a, ai and ei for the CuSum-AC algorithm) should be 
chosen such that the associated ARLFA is around 10 000. The 
parameters for the CuSum-AC algorithm are determined using 
the brute-force search technique. Multiple admissible combi¬ 
nations of the parameters a, ai and ei exist, among which the 
one that has the smallest detection delay is used. As depicted 



communication rate 


Figure 6; Detection delay versus the communication rate for 
the CuSum-AC algorithm and the CuSum algorithm with 
random transmission policy. 

in Fig. 1^ the CuSum-AC algorithm signihcantly outperforms 
the CuSum algorithm with the random transmission policy, 
in particular when the allowed communication rate is small. 
One also should note that the CuSum-AC algorithm has quite 
nice performance per se. In particular, when e = 0.1, the 
detection delay of the CuSum-AC algorithm is only 7 time 
slots larger than that of the CuSum algorithm (when there is 
no communication rate constraint, the CuSum-AC algorithm 
reduces to the CuSum algorithm). 

VII. Conclusion and Future Work 

In this paper, we have studied the problem of minimax 
quickest change detection with communication rate con¬ 
straints. The constraint is posed by limited energy at the remote 
sensors. An extension of the classical Lorden’s formulation 
was studied. We proposed the CuSum-AC algorithm; the 
CuSum algorithm is used at the fusion center and adaptive 
censoring strategies are used at the sensor nodes. The CuSum- 
AC algorithm was proved to be an equalizer rule, and be 
globally asymptotically optimal for any positive communi¬ 
cation rate constraint, as the ARLFA goes to inhnity. The 
numerical simulations showed that the CuSum-AC algorithm 
has a better detection performance versus communication rate 
trade-off than the CuSum algorithm with random transmission 
control policy. 

For the future work, there are two interesting directions. 
One is to explore the relationship between the detection 
performance when the ARLFA takes moderate values and the 
censoring strategy being used (in particular, to hnd whether 
there exists strictly optimal censoring strategy for every pos¬ 
sible ARLFA). The other one is to study the problem in the 
multi-channel setting (the change event only affects a subset 
of the sensors). 

Appendix A 
Proof oe Theorem[T] 

Before proceeding, we hrst give a supporting lemma as 
follows. Recall that the stopping time Tc{z, a) can be viewed 
as the hrst time that the CuSum-AC algorithm reaches the 
threshold a, when starting at z. About Tc{z,a), the follows 
hold. 
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Lemma 2. For any a and 0 < z < a, 

El [T,(z, a)] < El [T,(a)]. (21) 

Proof: The proof is done by cases. 

Case 0 < z < oi.' Because of the reset action of the CuSum- 
AC algorithm when it crosses ai from below, for 0 < z < ai, 
we have 

El [Tc(z, o)] = El [(j){z)] + El [Tc(ai, a)]. (22) 

The quantity Ei [TcCoi)®)] is a common term. Thus to obtain 
Lemma 1^ we only need to prove that for any 0 < z < oi 

El [f{z)] < El [0(0)], (23) 

which is easily obtained by sample path arguments. 

Case oi < z < a: Starting at z, there are two possible 
ways for the CuSum-AC algorithm to reach the threshold 
a eventually. One is that the algorithm never returns to the 
censored region before it stops, i.e., Sk > ai along the 
whole horizon; we denote this event by The other one 
is that the detection statistic Sk once crosses oi from up 
before the algorithm stops, which is denoted by Z^. Let 
p{z) = Pi {Z^} ■ We then have 

El [Tc{z, a)] 

= p(z)Ei [Tc{z,a)\zl] + (1 -p(z)) El [Tc{z,a)\Z^] 

= p{z)Ei [Tc{z,a)\Zl] 

+ {^-P{z)) (El [r]{z)\zj] +Ei [Tc{x,a);Sr,(^) = x\zj]) 
:=p(z)t^ + (1 - Piz)) (4 + 4) ■ 

The physical meaning of tf is the conditional average time it 
takes for the CuSum-AC algorithm to cross oi from up for 
the first time, when starting at z. 

Before proceeding, we first define a stopping time T{a) as 
follows. This is a stopping time for a variant of the CuSum 
algorithm that works in the same manner with the CuSum 
algorithm but starts at oi and bounded below by oi: 

T(a) = inf{fc : Ck > a}, 

where Ck involves by 

Ck = max(cfe_i +£{Xk),ai ), 

Co = Oi. 

For the quantities p{z), and t\, from Lemma 5 of | [l3) , we 
then have an inequality: for any t* > Ei[T(a)], 

p(z)f^ + (1-p(z)) (4+ f*) < C. (24) 

The reset action for the CuSum-AC algorithm when crossing 
oi from below together with the fact that {Xk}'s are i.i.d. 
under the measure Pi yields 

El [Tc(ai,a)] > Ei[f'(a)]. 

Combining ( |2^ , one obtains 

Ei[r,(a)] >Ei[r(a)]. (25) 


We then study the quantity 4.. 


4 =Ei [Tc{x 

, a) , 

ixi 

II 



El \Tc{x. 

’ 1 1 ^r]{z) ■ 

= x] dPi{ 

^ 0<ai<ai 




(ft) f 

El [Tc{x, 

a)]dPi{s^(^) 

< 

J 0<a:<ai 




(iel) 





< esssupEi [Tc(x, a)] 

0<rE<ai 

<Ei [T,(a)]. (26) 

The equality (el) follows from the Markovity of the detection 
statistic for the CuSum-AC algorithm. Thus, given = 
X, from the time instant k = r]{z) on, the evolution of the 
CuSum-AC algorithm, which starts at z, is exactly the same 
with that of a new CuSum-AC algorithm that starts at x. The 
inequality (iel) holds because of Holder’s inequality. 

Then for any ui < z < a, 

El [Tc(z, a)] =p(z)f^ + (1 - P(z)) (t\ + 4) 

(fel) / s. 

< p(z)f^ + (1 - p{z)) (^4 + El [rc(a)] j 

(ie2) 

< El [Tc(a)], 

where the inequality (iel) follows from ( |26] l and (ie2) follows 
from ( |24l i and ( |25] l. The proof thus is completed. ■ 

We are ready to prove Theorem From the Markovity of 
the detection statistic Sk for the CuSum-AC algorithm, one 
can see that the average detection delay is measurable with 
respect to s,y_i, i.e., 

E,[(T,(a) -iy + l)+|X,_i] =E,[(T,(a) - + l)+|s,_i] 

=E^[(Tc(si,_i,a)]. 

Note that under Assumption [T] for any censoring strategy 
'0*(ei), from the positive definiteness of K-L divergence p7) , 
one has Eoo [i'^ Xk)] < 0. It then follows that 

<0} :=p>0 . 

and for any iz > 1 

Poo{s .-1 = 0} > > 0. (27) 

We then obtain 

esssupEi,[(rc(a) -v + l)+|X^_i] 

i.-i 

= esssupEj,[Tc(s;,_i, a)] 

Su-l 

= E^[Tc(si,_i = 0,a)], 

where the last equality follows from Lemma [^ and ( |27l l. The 
proof thus is completed. 

Appendix B 
Proof of Lemma[T] 

We first give the upper bound of r(T') when a,ai and 
'0*(ei) are known. We then show that by choosing certain 
ei (equivalently with 0*(ei)), which is independent of a, the 
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upper bound can be any admissible value. The technique used 
is to interpret the CuSum-AC algorithm as a sequence of two- 
sided SPRTs as in Section HV-AI 
Define a random sequence {Ti} as 

To =m, 

Ti =Ti-i + Wi, Vj > 1, 

where Wi := wj;^'' + W^^^\ The random variables 

and are i.i.d. distributed with mean equal to 

Eoo [t?(0)|s^(o) < ai] andEoo ['/'(s,,(o)) |sr,(o) < oi], respec¬ 
tively. Note that the distribution of Wi is different from that 
of $ defined in ( |T4l i. The stopping time $ is for the evolution 
of the CuSum-AC algorithm (which may stop at some time, 
i.e., s^(o) > a), while for the definition of the communication 
rate constraint we implicitly assume that the CuSum-AC 
algorithm never stops. 

Based on {ff}, we define a reward sequence {i?i} as 


Rq —0, 

R, =R,-i + W^^^^ + Vz > 1, 


( 2'\ 

where Ji ^ B{Wl ',€i) is a binomial distributed random 
variable. 

Given ei,ai and a. 


(iel) 1 

r(T') < limsup-;prEo 

n—xx) ^ -^0 


.fe=To + l 


T > n 


= lim 


Ri 


i->oo Ti — Tn 


E 

(el) 


W^^^ + Ji 


Eoo [w^i] 

Eoo [t?( 0 ) 157 ^( 0 ) ^ til] -f CiEoo (^r;( 0 )) 1^77(0) ^ til] 


Eoo [??(0)|s^(o) < ai] -f Eoo [(A (S7,(0)) |Sr,(0) < til] 

:= f(T'), 


where the inequality (lel) holds because by the definition of 
<p{0), before the time instant Tq, all the observations are cen¬ 
sored using the censoring strategy ij}*{ei), the communication 
rate of which is ei (the lower bound of r(T')); the equality 
(el) follows from the alternating renewal process theory (Page 
173, 1^). 

Using the sample path arguments, one sees that, given ai, 
Eoo [? 7 ( 0 )|s 77 (o) < ai] is monotonically nondecreasing with a. 
Let 


For any censoring strategy Eoo Xk)] < 0. 

Furthermore, as ei —>• 0, Eoo Xk)] 0 

Note that when ei = 0, not only the mean of the log- 
likelihood ratio but also the random variable jkXk reduces 
to constant 0. By Corollary 2.6 of p4| , one sees that as 
Eoo Tffc)] 0, Eoo [()i' (2)] CO for any 0 < 

z < ai- Note also that because of the reset action whenever 
the CuSum-AC algorithm crosses ai from below, is 
only related with the distribution of under Poo, which 
is independent of the censoring strategy V’*(ci) (i-S-, ei)- 
Given any e, one thus can find a non-empty set £{ai,e) 
that is independent of a (i.e., independent of the distribution 
of {3^(0)1577(0) < '^ 1 }) such that for any censoring strategy 
'0*(ei) with ei S £{ai,e), f(T') < e. The proof thus is 
completed. 


Appendix C 
Proof of Theorem|2] 

By the definition of £*{ai, e), the CuSum algorithm satisfies 
the communication rate constraint 0. Note that the CuSum 
algorithm is strictly optimal for Problem [T] when e = 1. We 
prove Theorem]^ by relating the CuSum-AC algorithm to the 
CuSum algorithm. 

Step 1. We first prove that the ARLFA of the CuSum- 
AC algorithm that uses the censoring strategy defined in 
Theorem]^ is always larger than that of the CuSum algorithm. 
This is due to the definition of £*{ai, e) in ( [T5| ), by which ei 
(equivalent to ijj*i^i)) is appropriately chosen. Note that the 
ARLFA can be made arbitrarily large by adjusting ei. To this 
end, we define a stopping time as follows: 

T(a) = infjfc : Ck > a}, 


where the detection statistic Ck evolves by 
Cfc = (cfc-i + ('(Tffc))"'’, 

oij if c{, > ai and Cfe_i < oi, 
Ck = { 0, if c{, < ai andcfe_i > oi, 


-'ki 


Otherwise, 


Co = 0. 


The difference between Ck and Ck in 0 for the CuSum 
algorithm is that Ck is reset to be oi (0) whenever it crosses 
oi from below (up). Using the sample path arguments, one 
can see that for any a. 


Eoo 


f(a) 


> Eoo [r(a)] • 


(29) 


[7(0)|s77(o) < ai] . (28) 

Since Eoo [f(2ffc)] < 0 (by Assumption 0, Corollary 2.4 of 
yields that is hnite. Note that f(T') is monotonically 
nondecreasing with Eoo [? 7 ( 0 )|s,,(o) < oi], then for any a. 


'T^OC 

f(T') < 


'T'oo 

ai 


' ciEoo {577(0)) 1577(0) ^ til] 
F Eoo \_tj^ (577(0)) 1^77(0) ^ til] 

T°° 

+ ei 


®^cx5[<i*(^T7(0))l^?7(0)^^l] 
Eoo [<(>(s77{0) ) p77(0) < 0-1 

:=r(T'). 


+ 1 


Let N(k) be the number of Ck crossing oi from below by 
time instant k, i.e. 


N{k) = 
with N{0) 


N{k — 1) -F 1, if Ck > Oi and Ck-i < Oi, 
N{k—\), otherwise, 

0. Wald’s identity yields 


Eo 


f(a) 


= (Eoo [i?(0)] -F T(ai)Poo{s7,(o) < oi}) Eo 



^This approach is not necessarily monotonic. In some cases, 
Eoo can be zero for non-zero ei’s. 
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Let T^{a) = inf{fc : > a}, where is evolved in 

the same manner with Sk in ([^ for the stopping time Tc{a) 
except for the starting point. The detection statistic Sk starts 
at So = 0, while s^ starts with a random variable Sq = x 
and the distribution of x is the same with that of the random 
variable {p(0)|s^(o) < ai}. Using the sample path arguments, 
one sees that 


Eoo[Te(a)] >Eoo[r,^(a)]. (31) 

Let N^{k) be the number of s^ crossing oi from below by 
time instant k, which is defined in the same manner with N{k). 
Also by Wald’s identity, one has 

Eoo [T:{a)] 

— (Eoo [^(0)] + Eoo (®'l(0)) l'®i)(0) ^ til] Eoo{Sr;(0) ^ til}) 
Eoo [N<> (T^ia))] . (32) 


Both Cfc and s^ are reset to be oi when they cross oi from 
below, we thus obtain the following; 


Eoo 



=Eoo [N^ (T^ia))] 

1 

1 Eool^T^fO) ^ til} 


(33) 


where the last equality follows because both N{k) and N'^{k) 
are geometrically distributed. 

Combining ( [T5] l, ( [29] l, ( |30l l, ( [3T] i, ( |32| ) and ( [3^ , one can 
see that for any e, when the censoring strategy '0*(ei) with 
ei S £*{ai,€) is used. 


Since I(/i||/o) > 0, by Corollary 2.4 of |24|, one obtains 
that as a —>■ oo, Ei [?7(0)] —?► oo. Note that for any ei > 0, 
iV" ('^i)(/i||/o) > 0. From the established performance analy¬ 
sis technique for the CuSum algorithm (e.g., P. 142 of |[25j), 
one sees that Ei [<5ii(0)] < oo. Note that Ei [(/)(0)] is only 
related to oi and ^p*{el). By the definition of £*{ai, e), ijj*{ei) 
is independent of a. The following thus can be obtained: 


Ei[r2] ^ Ei[<^(0)] 


Ei[ri] 

Then a —>■ oo, 


< 


El ivm 


0 , 


as a —oo. 


Ei[r,(a)]=Ei[ri]((l + o(l))). (35) 


Because of the reset action when Sk crosses oi from below, 
the following holds: 


El[Ti] <Ei [T{a — oi)] 

ln(a — Oi) , ... 

= I(/i||/o)^ ^ 

Note that as a —>■ oo 

Combining ( [35] l, ( [3^ and ([37]l, one obtains that for any e, as 
a —■ oo. 


El [r,(a)] < El [r(a)] (1 + o(l)) 

El [r(a)] (1 + 0(1)), (38) 


Eoo [Tc{a)] > Eoo [T{a)]. (34) 

Then by the established performance results of the CuSum 
algorithm p5) , the CuSum-AC algorithm satisfies the ARLFA 
constraint Q( 

Step 2. We show that for any e, as a —oo, 

El [T,{a)] = El [r(o)] (1 + o(l)). 

The intuition is as follows. Since for any finite oi and 
ei > 0, the duration of Sk staying below oi (when censored 
observations are used) is finite, then as a —oo, the duration 
of Sk being above Sk (when raw observations) dominates. The 
asymptotic first-order behavior of the detection delay of the 
CuSum-AC algorithms thus resembles that of classical CuSum 
algorithm. 

Let 

Tc(a) 

^1 ^ 'y ^ l{sfc>ai}) 
k=l 

T 2 =Teia) - Ti. 


Recall that Sk is the detection statistic for the CuSum-AC 
algorithm. The quantity Ti (T 2 ) can be viewed as the duration 
that Sk stays above (below) oi. Following similar arguments 
in Step 1, one obtains that 




where (el) holds because given ( |34l i, Ei [r(a)] is the lower 
bound of El [Tc(a)] as a —>■ 00 . 

Step 3. By Theorem [T] 

dL(T,(a), 4-)= El [r,(a)]. 

Note that T{a) is asymptotically optimal for Problem as 
C, ^ 00 (i.e., a —> 00 ), when e = 1. Combining ( |3^ and p8|l, 
one obtains that the CuSum-AC algorithm is asymptotically 
optimal. The proof thus is complete. 
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